BEST AVAILABLE COW 




WORLD INTBUJBrrUAJLrap^TY ORGANIZATION 

INTERNATIONAL APPLICATION PUBLISHED UNDER THE PATENT COOPERATION TREATY (PCT) 



(51) 



«: 

C12Q 1/68, G01N 33/543, 33/544, 
33*548,33066 



Al 



(U) 
(43) 



Publication Dmes 



WO 95/12689 

11 May 1995 (1UQ5J9S) 



(30) 



08/147,785 



PCT/US94/12768 
4 November 1994 (04.11.94) 



4 November 1993(04.11.93) US 



(60) Parent Application or Grant 
(63) Related by Continuation 
US 

Hied on 4 



(71) Applicant (far off designated States 
CHECK, INC. [US/US]; Route 1, 1 
NY 02989 (US). 



08/147,785 (OT) 
1993(04,11.93) 



(/Rfc GENE 
236, Verxnontvilte, 



(72) UTWWf 1 

(75) InvemWApplkani flbr USonfyti WAGNER, Robert, E* Jr. 
[US/US]; Route L f Box 236, Vermontvule, NY 02898 (US). 



(74) Agents: UVNAT. Stamue] et aL; Morrison A Taenia, 2000 
Pennsylvania Avenue, N.W„ Washington, DC 20006 (US). 



AM, AU, BB r BO, BR, BY, CA, CN, CZ. 
Fit OB, HU, JP, KG, KP, KR, KZ, LK, LT, LV. MD. MG. 
MN, NO. NZ, PL, RO, RU, SI, SK, TJ, TT, UA, US, UZ. 
VN, European patent (AT, BE, CH, DE» DK, ES, FR, GB, 
OR, IE| IT, LU» MC, NL, FT, SE), OAP1 patent (BP, BJ T 
CP, CO, CL» CM* OA, GN, ML, MR, NB, SN, TD, VQ\ 
ARIPO patent (KB, MW, 5D, SZ). 



WinA UaernatUmat search report* 



(54)TOic IMMOBILIZED MISMATCH BINDING PROTEIN FOR DETECTION OR PURIFICATION OF MUTATIONS OR 
POLYMORPHISMS 

(57)Abatn«t 

A method for detecting mntattoni, such as a single base change or an addition or deletion of about one to four base pairs, is based 
on the ose of an hronobuJaed DNA mi*roatrfc-binding protein, each as MntS, which binds to a track*: acid hybrid having a single base 
mismatch or unpaired base or bases, thereby allowing die detection of mutations involving as Hole as one base change in a nucleotide 
sequence. Such s method la nsefcJ for diagnosing a variety of important disease antra or rasceptibilities, 
onco g ene and far faoflatsn g or removing by affinity ci iron ia lo gtaphy duplex DNA motectites contoinh _ 
motecnles In FCR-ampllned DNA samples. Also provided are kits nsefol for practicing me methods of the 



FOR THE PURPOSES OF INFORMATION (WET 




WO 9^12689 



PCT/US94/12768 



IMMOBILIZED MISMMCT BINDING PROTEIN FOR DECTCTIOW 
OR PURIFICATION CP MUTATIONS « POLYMORPHISMS 



5 BXCKQROmm OF THE INVENTION 

FA^Ad of the amremriffli 

The invention in the fields of molecular biology 
and medicine relates to a method for detecting mutations 
involving as little as one base change or a single base 
10 addition to, or deletion from, the wild-type DNA sequence, 
as well as methods for removing mismatch- containing DNA 
from batches of amplified DNA. 

PcBmptiPH of the Bagfrqxwflfl ftrt 

Progress in human molecular and medical genetics 

15 depends on the efficient and accurate detection of 

mutations and sequence polymorphisms, the vast majority of 
which results from single base substitutions and small 
additions or deletions. Assays capable of detecting the 
presence of a particular mutation or mutant nucleic acid 

20 sequence in a sample are therefore of substantial 

importance in the prediction and diagnosis of disease, 
forensic medicine, epidemiology and public health. Such 
assays can be used, for example, to detect the presence of 
a mutant gene in an individual, allowing determination of 

25 the probability that the individual will suffer from a 
genetic disease* The ability to detect a mutation has 
taken on increasing importance in early detection of cancer 
or discovery of susceptibility to cancer with the discovery 
that discrete mutations in cellular oncogenes can result in 

30 activation of that oncogene leading to the transformation 
of that cell into a cancer cell (Nishimura, S. et aJ., 
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" Biochem. J. 243:313-327 (1987) ; Bos, J.L., Cancer Res. 
45:4682-4689 (1989)) . 

The desire to increase the utility and 
applicability of such assays is often frustrated by assay 
5 sensitivity as well as complexity and cost. Hence, it 

would be highly desirable to develop more sensitive as well 
as simple and relatively inexpensive assays for detection 
of alterations in DNA. 

Nucleic acid detection assays can be based on any 

10 of a number of characteristics of a nucleic acid molecule, 
Buch as its size, sequence, susceptibility to digestion by 
restriction endonucleaBes, etc. The sensitivity of such 
assays may be increased by altering the manner in which 
detection is reported or signaled to the observer. Thus, 

15 for example, assay sensitivity can be increased through the 
use of detect ably labeled reagents, wherein the labels may 
be enzymes (Kourilsky et al., U.S. Patent 4,581,333), 
radioisotopes (Falkow et al., U.S. Patent 4,358,535; 
Berninger, U.S. Patent 4,446,237), fluorescent labels 

20 (Albarella et al. # EP 144914), chemical labels (Sheldon III 
et al., U.S. Patent 4,582,789; Albarella et al . , U.S. 
Patent 4,563,417), modified bases (Miyoshi et al., EP 
119448), and the like. 

Most methods devised to attempt to detect genetic 

25 alterations consisting of one or a few bases involve 

hybridization between a standard nucleic acid {DNA or RNA) 
and a test DNA such that the mutation is revealed as a 
miepaired or unpaired base in a heteroduplex molecule. 
Detection of these miepaired or unpaired bases has been 

30 accomplished by a variety of methods. Mismatches have been 
detected by means of enzymes (RNaseA, MutY) which cut one 
or both strands of the duplex at the site of a mismatch 
(Myers, R.M. et al., Cold Spring Harbor Sywp. Quant. Biol. 
52:275-284 (1986); Gibbs, R, et al . , Science 235:303-305 

35 (1987); Lu, A.S. et al., 1992, Genomics 14:249-255 ■ 
(1992)). Duplexes without mismatches are not cut. By 
using radioactively labeled nucleic acid fragments to 
anneal to a test DNA, it is possible to use these enzymes 
to generate specific size fragments when a mutation is 
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present in the test DNA. The fragments are distinguished 
from uncut fragments by means of polyacrylamide gel 
electrophoresis. The major problems with these methods are 
that they require the use of RNA (RNase method) or have the 
5 ability to detect only a limited number of mismatches {MutY 
method) . 

Mismatch- containing DNA duplexes have also been 
distinguished from perfectly matched duplexes by means of 
denaturing gel electrophoresis. In this system, duplexes 

10 are run on a polyacrylamide gel in a denaturing gradient 
under conditions where mismatch- containing DNA denatures 
more readily than the identical duplex lacking a mismatch, 
such that the two kinds of duplexes migrate differently. 
This method, while sensitive and accurate, is extremely 

15 laborious and requires a high level of technical 
sophistication * 

Two other methods of mutation detection depend on 
the failure to extend or join fragments of DNA when 
mismatches are present. Both require the use of standard 

20 DNA oligonucleotides that end precisely at the site of the 
mutation in question such that, when annealed to test DNA, 
it is the last base of the oligonucleotide which is 
mismatched. Mismatch detection depends either on (a) the 
inability of DNA polymerase to extend an oligonucleotide 

25 with a mismatched terminal base or (b) the inability of DNA 
ligase to join two oligonucleotides when there is a 
mismatch at the joint between them. Fragment length is 
determined by gel electrophoresis. Presence of longer 
fragments than the input oligonucleotides indicates that a 

30 mismatch, i.e., mutation, was not present in the test DNA. 
These methods are also somewhat laborious, require that the 
exact location of the mutation be known and are difficult 
to interpret when the sample DNA is heterozygous for the 
mutation in question. Therefore, they are not practical 

35 for \ase in screening for polymorphisms* 

A chemical method for cleavage of mismatched DNA 
(Cotton, R.G* et al., Proc, Natl* Acad. Sc±. USA 
85:4397-4401 (1988); Cotton, R.G. , Wuc. Acids Res 
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27:4223-4233 (1969)) is based on chemical cleavage at a 
mismatch site in a DMA-DMA heteroduplex, using a number of 
agents, in particular osmium tetroxide and hydroxy 1 amine . 
In this method DMA probes are prepared by restriction 
5 enzyme cleavage of DNA of interest. Plasmid DMA containing 
the sequence of interest is hybridized to labeled probe DNA 
(either end-labeled or internally labeled with ^P) . 
Hydroxyl amine chemically modifies mismatched cytosines; 
osmium tetroxide modifies mismatched thymines. Piperidine 

10 is then used to cleave the DMA at the modified sites, 

followed by polyacryland.de gel electrophoresis (PAGE) and 
autoradiography to identify the cleavage products. This 
method is said to have the advantage of detecting all 
possible single base pair mismatches because , the method 

15 also results in cleavage at a matched base pair in the 
vicinity of a mismatch. 

Publications from Caskey's laboratory (Caskey, 
C.T. et al., European Patent Publication 333,465 (9/20/89); 
Grompe, M. et al., Proc. Natl. Acad. Sci. USA 0£:5BBB-5B92 

20 (1989)) disclose a method for localizing a mutation which 
utilizes PCR-amplified cDMA as a source of template for the 
mismatch cleavage reaction. This technique was 
successfully applied in studying ornithine 
trans carbamoyl as e (OTCase) deficiency patients to map 

25 mutations. 

Kung et al., U.S. Patent 4,963,658, discloses 
detection of single stranded DMA (ssDMA) by binding with a 
high* affinity ssDNA- binding protein r such as a 
topoisomerase or a DMA unwinding protein which itself can 
30 be bound to a label, such as 0-D-galactosidase. 

MISMATCH REPAIR SYSTEMS AND MISMATCH BINDING PROTEINS 

DNA mismatch repair systems employ a family of 
proteins including proteins which recognize and bind to 
mismatch* containing DNA, which are designated mismatch 
35 binding proteins (MBPs) • For reviews, see Radman, M. et 
al., Axxnu. Rev. Genet. 20:523-538 (1986); Radman, M. et 
al-, Sci. Amer., August 1968, pp. 40-46; Modrich, P., J** 
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Biol* Chein, 254:6597-6600 (1969)). The MutS protein was 
identified as such a component of the E. coli mismatch 
repair system. See, for example, Lahue, R.S. et al., 
Science 245:160-164 (1989); Jiricny, J« et al., Wucl . Acids 
5 Res, 2^:7843-7653 (1968); Su, S.S. et al., J, Biol. Cheizi. 
253:6829-6835 (1966); Lahue, R.S. et al . , Afutat. JZes. 
190:37-43 (1986); Dohet, C. et al,, Wol . Gen. Genet. 
205:161-184 (1987); and Jones, M. et al., Genetics 115:605- 
610 (1967) . Analogous proteins are known in other 

10 bacterial species including MutS in Salmonella typJiimurium 
(Lu, A.L. et al., Genetics 218:593-600 (1988); Haber L,T, 
et al., J. Bacterid. 170:197-202 (1986); Pang, P.P. et 
al*, J. Bacterid. 253:1007-1015 (1965)) and the hexA 
protein of Streptococcus pneumoniae (Priebe S.D. et al., 

15 i7. Bacteriol. 270:190-196 (1988); Haber et al., Bupra) . 

Purified MutS protein binds DNA containing 
mispaired bases, but does not bind DNA without mismatches 
or single -stranded DNA.. The MutS-DNA interaction does not 
result in any degradation or modification of the DNA. None 

20 of the above references disclose the possibility of using a 
MBP or immobilized MBP as part of a mutation detection 
assay or for purposes of removing mismatched DNA from 
amplified DNA samples. 



SUMMARY OF THE MTTIOH 

25 The present inventor has conceived of the use of 

immobilized mismatch binding protein (MBP) , for example, 
the MutS protein of B. coli, for (1) the detection of 
genetic mutations or genomic polymorphisms, (2) the 
purification of amplified DNA samples by removing 

30 contaminating sequences and sequences containing errors 

introduced during the amplification processes, and (3) the 
identification of specific alleles in mult i -allelic 
systems. 

The nucleic acid, preferably DNA, being analyzed 
35 can be obtained from any source, including blood cells, 



W09V12689 



PCT/USW/12768 



tumor tissues, cells in culture or any tissue, and can be 
obtained from any species including humans. The DNA may be 
labeled by any of a variety of well-known methods, uBing 
colorimetric , chemi luminescent or radioactive markers. In 
5 fact, it is not necessary to label test DNA at all. 

For detection of mutations or polymorphism, the 
assay can be performed with a labeled competing 
oligonucleotide. For purification of amplified DNA, no 
label is required. For allele identification, the label 
10 must be in a synthesized single -stranded oligonucleotide 
probe . 

The methods of the present invention depend on 
the creation of mismatches in the test DNA which are 
revealed by denaturing the test DNA and allowing it to 

15 reanneal. When testing for heterozygosity or for 

polymorphism within a test DNA sanple, the test DNA can 
b imply be self -annealed, resulting in formation of 
mismatches when the single strands reanneal with a strand 
descended from the other parental chromosome. If no 

20 heterozygosity exists, no mismatches will be formed, in 
this case, the label can be in the primers used for 
amplification or may be added to the termini of the test 
DNA if amplification is not required. 

A similar procedure and labeling scheme is used 

25 to remove sequences containing errors introduced during 
amplification of DNA or minority sequence species. In 
these cases, the material which does not bind to the MBP is 
recovered! and contains only those duplex sequences without 
mismatches. These sequences will therefore be greatly 

30 enriched for the majority sequence in the amplified 

population. When the starting material contains only one 
sequence, the unbound material will contain those sequences 
which are identical to the starting material, while those 
sequences containing errors introduced during araplif ication 

35 will, provided they are relatively rare, have formed 
mismatches which are retained by the immobilized HBP. 

To detect homozygous mutations, it in necessary 
to anneal the test DNA in the presence of known wild-type 
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sequences. Such sequences can be synthesized artificially 
or created during anplif ication by adding known wild-type 
sequences to the starting material before amplification. 
When annealing is performed in the presence of known wild- 
5 type sequences, the assay will detect both homozygous and 
heterozygous mutations. 

For allele identification, it is necessary to add 
a labeled single -stranded probe DNA to the test DNA after 
amplification. The probe sequence must be identical to the 

10 allele of interest such that no mismatches are formed when 
it anneals to DNA of that allele. The sequence of the 
probe is selected so that it forms mismatches when annealed 
to the DNA of any other allele. When test DNA is annealed 
to such a probe (with the test DNA in excess such that the 

15 probe sequences all anneal to form a duplex) and exposed to 
excess immobilized MBP, the presence of unbound label 
indicates that the allele in question is present in the 
test DNA sample* 

Thus, the present invention is directed to a 

20 method for detecting a mutation from .a non-mutated sequence 
of a target polynucleotide, preferably DNA, in a sample, 
comprising: 

(a) incubating a detect ably labeled polynucleotide or 

oligonucleotide from the sample with an 
25 immobilized mismatch-binding protein under 

conditions in which mismatch-containing 
polynucleotide molecules bind to the immobilized 
protein; and 

{b detecting the binding of any mismatch- 
30 containing polynucleotide from the sample to 

the mismatch-binding protein, 
whereby the presence of detectably labeled polynucleotide 
or oligonucleotide bound to the mismatch-binding protein is 
indicative of a mutation In the sequence of the target 
35 polynucleotide. 

Also provided is a method for detecting a 
mutation from a non-mutated sequence of a double stranded 
target mammalian polynucleotide in a sample, comprising: 
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(a) denaturing any double stranded polynucleotide in 
the sample followed by allowing DNA strands to 
reanneal; 

(b) incubating the denatured and reannealed double 

5 stranded nucleotide of step (a) with a mismatch- 

binding protein immobilized on a solid support, 
either 

(i) in the presence of a detect ably labeled 
mismatch- containing oligonucleotide capable 

10 of binding to the HBP; or 

(ii) wherein the MBP was preincubated with and 
allowed to bind a detectably labeled 
mismatch- containing oligonucleotide; and 

(c) detecting the amount of detectably labeled 

15 mismatch-containing oligonucleotide bound to the 

mismatch-binding protein 
whereby the presence of a mutation in the double stranded 
mammalian polynucleotide of the sample results in a 
decrease in the binding of the detectably labeled 
20 oligonucleotide to the mismatch -binding protein. 

A preferred MBP in the above methods is the E. 
coli MutS protein or a functional derivative thereof* 

Preferred solid supports on which the mismatch 
binding protein is immobilised include, but are not limited 
25 to, modified cellulose, polystyrene, polypropylene, 

polyethylene, dextran, nylon, polyacrylamide and agarose. 
A most preferred solid support is a nitrocellulose 
membrane , 

A preferred detectable label for the detectably 
30 labeled poly- or oligonucleotides in the above methods is 
biotin. 

Also provided is a method for the removal from an 
amplified SNA sample of minority sequences and sequences 
containing sequence errors introduced during the process of 
35 amplification, which method comprises: 

(a) subjecting the amplified DKA sample to conditions 

of denaturation followed by reanneal ing, such 
that minority sequence or error- containing 
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sequences form mismatch-containing DNA duplexes, 
thereby generating a mixture containing 
mismatched duplexes; 

(b) incubating the mixture of step (a) with an 

5 immobilized mismatch-binding protein so that 

mismatch-containing duplexes bind to the HBP; and 

(c) removing the immobilized MBP to which mismatch- 
containing DNA has bound from the amplified DNA 
sample, 

10 thereby removing the sequences containing sequence errors. 

In another embodiment is provided a method for 
identifying a specific allele in a nrulti-allelic system in 
a sample of amplified DNA, comprising: 

(a) mixing a detectably labeled oligonucleotide probe 
15 which is perfectly complementary to the DNA 

sequence of the specific allele with an excess of 
amplified test DNA under conditions of 
denaturation followed by annealing such that, 
after denaturing and annealing, every copy of the 
20 probe will be found in a duplex DNA; 

(b) incubating the mixture of step (a) with an excess 
of immobilized MBP such that all mismatch- 
containing DNA is retained on the immobilized 
MBP; 

25 (c) removing said immobilized MBP to which has bound 

any mismatch-containing DNA from the amplified 
test DNA; and 

(d) detecting the presence of the detectably labeled 
probe in the sample from which the immobilized 

30 MBP has been removed, 

wherein, the presence of labeled DNA in the sample 
indicates that the probe is perfectly complementary to an 
allele in the test DNA. 

In the above methods, the immobilized MBP may be 

35 (a) in a form which is removable by centrifugation, or (b) 
immobilized onto a column support material wherein the flow 
through material is devoid of mismatch -containing duplexes, 



WO 95/11689 



PCT/US94/12768 



- 10 - 

or (c) immobilized on a filter support such the filtrate is 
devoid of mismatch-containing duplexes* 

The present invention is also directed to a kit 
useful for detecting a mutation from a non-mutated sequence 
5 of a target polynucleotide sequence in a sample, the kit 
being adapted to receive therein one or more containers, 
the kit comprising: 

(a) a first container containing an immobilizable 

mismatch-binding protein; 
10 (b) a second container containing a solid support 

capable of immobilizing the HBP; and 
(c) a third container or a plurality of containers 

containing a reagent or reagents capable of 
detecting the binding of a detectably labeled 
15 mismatch- containing nucleic acid hybrid to the 

mismatch-binding protein. 

Also provided is a kit useful for detecting a 
mutation from a non-mutated sequence of a target 
polynucleotide sequence in a sample, the kit being adapted 
20 to receive therein one or more containers, the kit 
comprising : 

(a) a first container containing a mismatch -binding 
protein immobilized on a solid support; and 

(b) a second container or a plurality of containers 
25 containing a reagent or reagents capable of 

detecting the binding of a detectably labeled 
mismatch-containing nucleic acid hybrid to the 
mismatch- binding protein. 

In the above kits, the MBP is preferably MutS or 
30 a functional derivative thereof* The solid support is 

preferably selected from the group consisting of natural 
cellulose, modified cellulose, most preferably 
nitrocellulose, or polystyrene, polypropylene, 
polyethylene, dextran, nylon, polyacrylamide and agarose. 
35 Also provide is a mismatch binding protein, 
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preferably the MutS protein or a functional derivative 
thereof, immobilized on a solid support, such that said 
immobilized mismatch binding protein is capable of binding 
to mismatch- containing polynucleotide molecules. 

Figure 1 shows the results of a direct assay of 
mismatches using nitrocellulose -bound MutS* Increasing 
amounts of biotinylated mismatch- containing DNA (upper 2 
lines) or mismatch- free DNA (lower 2 lines) were added to 

10 the reaction mixtures. 

Figure 2 shows the results of a competition assay 
of mismatched duplexes using nitrocellulose -bound MutS 
protein. Increasing amounts of unlabeled mismatch- 
containing 30-mer (upper 2 lines) or mismatch-free 30-mer 

15 (lower 2 lines) were added to 5 ng biotinylated mismatch- 
containing 30-mer. The far right column on the figure 
represents wells which contained no MutS. 

Figure 3 shows the results of binding of mismatch 
containing DNA to E. coli mutS immobilized on 

20 nitrocellulose. Mismatch-containing DNA duplexes (2157 and 
Bio-Het+) show darker (or visible bands) at concentrations 
where homoduplex DNA shows lighter (or no) bands. 

Figure 4 shows the nucleotide sequences of 
synthetic oligonucleotides (30mers) prepared with a single 

25 mismatch at position 15 or 16 or with 1-4 unpaired bases 
between positions 15 and 16. Mismatched or unpaired bases 
are shown in boldface. Results of studies detecting these 
mismatches or unpaired bases are shown in Figure 5. 

Figure 5 shows the results of binding of DNA 

30 duplexes containing the indicated mismatches, or unpaired 
bases, to E. coli mutS immobilized on nitrocellulose. 

DMCRTPTIQMr OF THE PREFERRED EMBODIMENTS 

The present inventor conceived of a new, broadly 
applicable and relatively simple method for detecting a 
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single base change in a DNA sequence or several such base 
changes. This method is based upon the formation of a 
mismatch- containing heteroduplex when a strand of mutant 
DNA and a n complementary" strand of wild- type DNA 
5 hybridize. 

The presence of the mismatch is detected in a 
highly specific manner by first allowing the DNA to bind to 
an immobilized mismatch-binding protein (MBP) , such as the 
Nuts protein of E. coli. The presence of DNA bound to the 

10 MBP is then detected in any of a number of ways, depending 
on the label used and whether the assay is a direct assay 
or a competitive assay. This method stands in stark 
contrast to methods of the prior art which employ mismatch 
cutting nuclease enzymes capable of breaking DNA at or near 

15 a mispaired base pair. 

The methods described herein provide a 
mutation/polymorphism detection system having the 
advantages of (a) simplicity, (b) accuracy, (c) ability to 
be used without radioactivity, (d) ability to detect all 

20 single base substitution mutations and addition or deletion 
mutations of 1-4 bases. 

Standard reference works setting forth the 
general principles of recombinant DNA technology and cell 
biology, and describing conditions for isolation and 

25 handling of nucleic acids, denaturing and annealing nucleic 
acids, hybridization assays, and the like, include: 
Sambrook, J. et a!., MOLECULAR CLONING: A LABORATORY 
MANUAL, 2nd Edition, Cold Spring Harbor Press, Cold Spring 
Harbor, NY, 1989? Albere, B. et al., MOLECULAR BIOLOGY OF 

30 THE CELL, 2nd Ed., Garland Publishing, Inc., New York, NY, 
1989? Watson, J.D., et al., MOLECULAR BIOLOGY OF THE GENE, 
Volumes I and II, Ben j amin/Cummings Publishing Co., Inc., 
Menlo Park, CA, 1987; Darnell, J.E. et al., MOLECULAR CELL 
BIOLOGY, Scientific American Books, Inc., New York, NY, 

35 1986; Lewin, B.N., GENES II, John Wiley & Sons, New York, 
NY, 1985, which references are hereby incorporated by 
reference in their entirety. 
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KBPb are proteins of around 100 kDa, have been 
identified in and isolated from both bacteria and higher 
organiBtns and selectively bind DMA containing mismatched 
bases. KBPs have been found in yeast (Valle G et al* t 1991 
5 Yeast 7:9B1-9B8; Miret J.J* et al . , 1993, J. Biol Chem. 
2*0:3507-3513), as well as in humans (Stephenson, C. et 
al. f 1989, J. Biol* Chem* 2*4:21177-21782; Xarran, P et 
al., 1990, Wiitat. «Res. 23^:269-275; Hughes M.J. et al . , 
1992, J. Biol. Chen?. 267.23876-23882; Reenan, A.G. et al . , 
10 1993, Genetics 132:963-973; Reenan, A.G. et al., 1993, 

Genetics 132:975-985). Mismatch binding proteins from 
Xenopus and from mouse have been cloned by M. Radman and 
colleagues. 

A preferred MBP is characterized by its ability 

15 to bind DKA-DNA (or DNA-RNA or RNA-RNA) duplexes containing 
mispaired or unpaired bases, to the significant exclusion 
of single stranded polynucleotides or perfectly matched 
duplexes. In a preferred embodiment, the intact native 
MutS protein from E. coli is used. However, as uBed 

20 herein, the term "mismatch binding protein" or "MBP" is 
intended to encompass a functional derivative of the 
intact, native protein. By "functional derivative" is 
meant a "fragment," "variant , ■ "analog," or 
"chemical derivative" of the protein which retains the 

25 ability to bind to a mismatch-containing nucleic acid 

heteroduplex, which permits its utility in accordance with 
the present invention. 

A "fragment" of a MBP refers to any subset of the 
molecule, that is, a shorter peptide. A "variant" of the 

30 protein refers to a molecule substantially similar to 

either the entire protein or a DKA-hybrid-binding fragment 
thereof. A variant of a mismatch- binding protein, for 
example, of MutS, may be prepared by recombinant DNA 
methods well-known in the art. 

35 A preferred functional derivative of MutS is a 

homologue of E. coli MutS in another species, such as the 
MutS protein of Salmonella typhi murium (Lu, A.L. et al., 
supra} Haber L.T. et al., Bupra: Pang, P.P. et al., supra) 
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or the hexA protein of Streptococcus pneumoniae {Priebe 
S.D. et al w supra; Haber et al,, supra). In addition, 
possible eukaryotic homologues of MutS or HexA can also be 
used, such as those encoded by the homologous sequences 
5 identified in human, mouse, frog or hamster SNA (Shlmada, 
T. et al w J. Biol. Chezn. 254:20171 (1989); Linton, J. et 
al., Molec. Cell. Biol. 7:3058-3072 (1989); Pujii, H. et 
al., J. Biol. Chera. 264:10057 (1989)). 

A "chemical derivative" of the MBP contains 

10 additional chemical moieties not normally a part of the 

protein, including additional stretches of amino acids as 
in a fusion protein. Covalent modifications of the peptide 
are included within the scope of this invention. Such 
modifications may be introduced into the molecule by 

15 reacting targeted amino acid residues of the protein with 
an organic derivatizing agent that is capable of reacting 
with selected side chains or terminal residues. 

In selecting a protein as being a useful MBP for 
the methods of the present invention, assays can be 

20 performed by one of ordinary skill in the art using 

conventional methods. Thus, for example, in evaluating a 
sample for the presence of a MBP useful in the present 
invention, one can perform a mismatch binding assay, such 
as that described for MutS by Jiricny et al. (which 

25 reference is hereby incorporated by reference in its 

entirety} . Preferably, a filter binding assay is used. 
To prepare the oligonucleotide heteroduplex, an 
oligonucleotide, preferably of about 16 bases is labeled 
with ^P using a kinase reaction and -y-^P-ATP using a kinase 

30 such as T4 -polynucleotide kinase. The 5' -labeled 

oligonucleotide (which can be stored at -20 *c) is then 
annealed with a complementary oligonucleotide having a 
single base pair mismatch under standard conditions. The 
annealed 16 base pair heteroduplex is mixed with an excess 

35 of the protein being tested and kept on ice for 30 minutes. 
The mixture is then applied to a nitrocellulose filter 
which has been prewetted in the assay buffer. Gentle 
suction is applied for several seconds, and the filter is 
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washed extensively with ice-cold assay buffer. The filter 
is then dried in air, suspended in scintillation fluid and 
counted. By virtue of the protein sticking to the filter, 
any counts on the filter can be attributed to binding to 
5 the putative MBP. In the absence of such a protein, the 
labeled oligonucleotide heteroduplex will pass through the 
filter. Thus, by using such a simple assay, one can easily 
detect and select a MBP useful in the methods of the 
present invention. 

10 As used in the present invention, the MBP is 

immobilized to a solid support or carrier. By "solid 
support" or "carrier 11 is intended any support capable of 
binding a protein. Well-known supports or carriers include 
natural cellulose, modified cellulose such as 

15 nitrocellulose, polystyrene, polypropylene, polyethylene, 
dextran, nylon, polyacryl amide , and agarose or Sepharose®. 
Also useful are magnetic beads. The support material may 
have virtually any possible structural configuration so 
long as the immobilized MBP is capable of binding to the 

20 target nucleic acid molecule. Thus, the support configu- 
ration can include microparticles, beads, porous and 
impermeable strips and membranes, the interior surface of a 
reaction vessel such as test tubes and raicrotiter plates, 
and the like. Preferred supports include nitrocellulose 

25 disks or strips. Those skilled in the art will know many 
other suitable carriers for binding the MBP or will be able 
to ascertain these by routine experimentation. 

Most preferred is a solid support to which the 
MBP is attached or fixed by covalent or noncovalent bonds. 

30 Preferably, noncovalent attachment is by adsorption using 
methods that provide for a suitably stable and strong 
attachment. The MBP is immobilized using methods well* 
known in the art appropriate to the particular solid 
support, providing that the ability of the MBP to bind 

35 mismatch- containing DHA is not destroyed. 

The immobilized MBP is then easily used to detect 
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heterozygosity (or polymorphism) as well as single base 
mutations, or to isolate mismatch- containing DNA from a 
mixture, or to rid a mixture of mismatch-containing DNA. 

In one embodiment, the surface of polystyrene or 
5 other plastic multiwell plates serves aa the solid support. 
In another embodiment, a solid support to which the MBP is 
bound is affixed to the bottom of wells of multiwell 
plates . 

In a preferred embodiment, the immobilization and 
10 DNA binding can be performed in a 96 well blotting 

apparatus and the resulting sheet of nitrocellulose (or 

other support) paper can be removed to evaluate reactions. 

For example, color development on the nitrocellulose can be 

used to evaluate binding based on an enzyme as part of the 
15 detection system and a chromogenic or chemiluminescent 

substrate for the enzyme serving as the precursor of the 

color reactions. 

Following attachment of the MBP to the support , 

the support is treated ("blocked 11 ) to prevent further 
20 binding of proteins or nucleic acids, using methods and 

reagents well-known in the art. 

The immobilized MBP is contacted with and allowed 

to bind (to saturation) small oligonucleotide heteroduplex 

molecules. The oligonucleotides preferably have about 30 
25 base pairs. For testing, a DNA duplex containing a 

mismatch which is well recognized (i.e., bound) by the MBP 

is used. 

PREPARATION OF OLIGONUCLEOTIDES CONTAINING 
OR LACKING MISMATCHES 

30 Such oligonucleotides are prepared using a 

nucleotide modified at the 5' end with a detectable label 
such that they can be quantitatively detected by 
appropriate detection methods, preferably spectrophotometry 
or chemilumineecence . In a preferred embodiment, the 

35 oligonucleotide is biotin-modified r and is detectable using 
a detection system based on avidin or streptavidin which 
binds with high affinity to biotin. The streptavidin can 
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be conjugated to an enzyme, the presence of which is 
detected using a chromogenic substrate and measuring the 
color developed. 

Examples of useful enzymes in the methods of the 
5 present invention are horseradish peroxidase, alkaline 
phosphatase, glucose -6 -phosphate dehydrogenase, malate 
dehydrogenase, staphylococcal nuclease , del ta-V- steroid 
isomerase, yeast alcohol dehydrogenase, a-glycerophosphate 
dehydrogenase, triose phosphate isomerase, asparaginase, 

10 glucose oxidase, 0-galacto-sidase, ribonuclease , urease, 
catalase, glucoaraylase and acetylcholinesterase. 

The detectable label may also be a radioisotope 
which can be detected by such means as the use of a ga mma 
counter or a scintillation counter or by autoradiography. 

15 The detectable label may also be a fluorescent 

compound. When the fluorescently labeled molecule is 
exposed to light of the proper wave length, its presence 
can then be detected due to fluorescence using microscopy 
or fluorometry. Among the most commonly used fluorescent 

20 labelling compounds are fluorescein isothiocyanate, rhoda- 
mine, phycoerythrin , phycocyanin, allophycocyanin, £- 
phthaldehyde and fluorescamine. 

The detectable label may be a fluorescence 
emitting metal such as l52 E\i, or others of the lanthanide 

25 series. These metals can be attached to the 

oligonucleotide using metal chelating groups such as 
diethylenetriaminepentaacetic acid or ethyl enediaminetetra- 
acetic acid. 

The detectable label may be a chemiluminescent 

30 compound. The presence of the chemiluminescent -tagged 
molecule is then determined by detecting the presence of 
luminescence that arises during the course of a chemical 
reaction. Examples of particularly useful chemiluminescent 
labeling compounds are luminol, isoluminol, theromatic 

35 acridinium ester, imidazole, acridinium salt and oxalate 
ester. 

Likewise, a bioluminescent compound may be used 
to label the oligonucleotide. Bioluminescence is a type of 
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cherai luminescence found in biological systems in which a 
catalytic protein increases the efficiency of the 
chemiluminescent reaction. The presence of a biolumi- 
nescent protein is determined by detecting the presence of 
5 luminescence. Important bioluminescent compounds for 
purposes of labeling are luciferin, luciferase and 
aequorin. 

A MBP bound to the DNA- DNA, DNA-RNA or RNA-RNA 
hybrid can be detected either directly or indirectly. For 

10 direct detection, the poly- or oligonucleotide duplex is 
detectably labeled using labels as discussed herein „ 
For indirect detection, the assay utilizes 
competition of binding to the MBP of the test DNA with an 
already bound or a contemporaneously exposed mismatch- 

15 containing duplex. Thus, a labeled mismatch -containing 
oligonucleotide is pre-bound to the MBP or is incubated 
together with the MBP and test DNA. The more mismatch- 
containing DNA in the test sample, the less binding of the 
labeled oligonucleotide to the MBP will occur. 

20 The test sample to be assayed can be in any 

medium of interest, and will generally be a sample of 
medical, veterinary, environmental, nutritional, or 
industrial significance. Human and animal specimens and 
body fluids particularly can be assayed by the present 

25 method, providing that they contain cells from which 

nucleic acids can be prepared. Preferred sources include 
blood, sperm, other tissue, milk, urine, cerebrospinal 
fluid, sputum, fecal matter, lung aspirates, throat swabs, 
genital swabs and exudates, rectal swabs, and 

30 nasopharyngeal aspirates. 

To detect heterozygosity or polymorphism in DNA 
from a diploid organism, test DNA is preferably prepared by 
denaturing and annealing PCR-atnplif led DNA from a diploid 
35 organism. The test DNA is prepared with labeled primers, 

annealed and added to a well or other reaction vessel which 
contains immobilized MBP already bound to mismatched 
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oligonucleotides* Alternatively, test DNA can be mixed 
with mismatched oligonucleotides and the mixture added to a 
well or other vessel containing immobilized MBP 

A spectrophotometry reading is made at the 
5 wavelength appropriate for quantitative detection of the 
test DNA* After an incubation period suitable to allow 
either (1) binding of the test DNA to the immobilized HBP 
or (2) displacement of the mismatched oligonucleotide from 
the immobilized HBP, the DNA solution is removed, the well 

10 washed, and a spectrophotometry reading made at the 

wavelength appropriate for quantitative detection of the 
bound mismatched oligonucleotide. 

The ratio of the reading for test DNA to the 
reading for the mismatched oligonucleotide will be vastly 

15 different for mismatch -containing and mismatch-free test 
DNA over a wide range of DNA concentrations* Standard 
curves are prepared using known quantities of DNA to allow 
characterization of test DNA as homozygous or heterozygous 
without having to quantitate the test DNA prior to the 

20 ' assay. Thus, a single DNA sample is sufficient to 

determine heterozygosity and a single 96 well microplate 
will allow the testing of at least about 80 different DNA 
samples . 

DETECTION OF HOMOZYGOUS MUTATIONS 

25 To detect homozygous mutations, known homozygous 

wild-type DNA must be combined with the test DNA sample 
before denaturing and annealing. Only test DNA containing 
a mutation (homozygous) will form mismatch-containing DNA 
that can compete with the mismatched oligonucleotide for 

30 binding to the immobilized MBP. 

The method of the present invention can be used 
with (a) only the mismatched oligonucleotide labeled or (b) 
only the. test DNA labeled. However, both of these methods 
require that the nucleic acid concentration be determined 

35 and that the test be performed at several different test 
DNA concentrations. 

When the mismatched oligonucleotides are labeled, 
the test is based on competition with several different 
concentrations of test DNA and comparison of the resulting 
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curve (with concentration expressed as moles of duplex 
molecules) with standard curves for mismatched and non- 
mismatched standards. 

When the test DNA is labeled, the test involves 
5 measuring the extent of binding to the MBP of several 
concentrations below saturation and comparison of the 
resulting curve with standard curves for mismatched and 
non-mismatched standards, 

PURIFICATION OF AMPLIFIED DNA SAMPLES 

10 One of the most revolutionary and widely used 

technologies currently employed in modern molecular biology 
is the process of polymerase chain reaction (PGR) , which 
amplifies DNA sequences from starting amounts so minute as 
to be nearly undetectable. Por reviews of PGR, see: 

15 Mullis, K.B* , 1986, Cold Spring Harbor 9ymp. Quant. Biol. 
51:263-273; Saiki, R.K. etal., 1985, Bio/Technology 
3il008-1012; and Mullis, X.B, et al. f 1987, Afeth. Enzymol. 
155:335-350. In addition, because PCR can amplify specific 
sequences, it allows the purification of specific 

20 sequences, basically in a single step, from genomic DNA, 
PCR is an essential component of virtually all studies of 
the human genome, is a central component of gene 
identification and cloning, is increasingly used in the 
diagnosis of genetic and infectious diseases and is widely 

25 used in forensics. 

However, for some applications, in particular, 
gene cloning and mutation detection, PCR suffers from an 
inherent tendency of the polymerases to make mistakes by 
inserting incorrect, non- complementary bases during 

30 synthesis. Although the fidelity of most replicative 
polymerases in vivo, is such that they insert only one 
incorrect base for every 10 10 bases replicated, polymerases 
used in PCR can have error rates as high as one incorrect 
base for every 10 4 bases replicated. This high an error 

35 rate can mean that a significant fraction of the amplified 
molecules will not be identical in sequence with the 
starting material. 

The present methods are useful for purifying 
amplified DNA samples using an immobilized MBP to remove 
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minority sequences and molecules containing sequence 
alterations introduced by the amplification process. For 
example, if a DNA segment is amplified through 20 rounds of 
replication (a common amount of amplification) , a 
5 significant fraction of the final molecules may contain one 
or more incorrect bases* In cloning experiments, this 
greatly increases the risk of cloning a nucleotide sequence 
different from the starting sequence. 

In mutation detection assays involving 

10 denaturation and annealing of a PCR-amplif ied sample, 

incorrect bases inserted during PCR may be scored as if 
they were mutations in the original sample. Thus, for 
accurate mutation detection it is necessary to eliminate 
all DNA molecules with sequence alterations introduced by 

15 PCR copy errors* The method described here accomplishes 
this purification in a simple and straight -forward manner. 

Immobilized MBP can be used to purify amplified 
DNA samples. MBP is immobilized by binding to solid phase 
supports, preferably nitrocellulose filters, sepharose 

20 beads or magnetic beads* The filters or beads are treated, 
if necessary, to prevent the binding of double -stranded 
DNA. The amplified DNA sample is denatured, by heating, 
and allowed to reanneal. Given the random nature of PCR 
mistakes, virtually all incorrect bases will be found in 

25 mismatched base pairs after annealing* 

The immobilized MBP is added to the sample and 
the solution mixed by gentle shaking. The immobilized MBP, 
and any bound mismatch- containing DNA, is removed, for 
example, by removing the filter, by allowing the beads to 

30 settle out of solution or by removing the beads 

magnetically, depending on the nature of the solid support 
used. This leaves behind precisely matched DNA duplexes. 

In addition to purifying amplified DNA samples by 
removing molecules containing errors introduced during 

35 amplification, purification using an immobilized MBP is 
used to enrich for majority sequences when samples of 
diverged, repeated DNA sequences, such as of immunoglobulin 
genes, are being examined. 

To remove completely a minority species from a 

40 sample amplified from a mixed population of DNA (with 
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respect to sequence} , it may be necessary to perform more 
than one round of purification as described herein and 
possibly more than one round of amplification. 

Note that the present method can be used to 
5 purify sequences from both homozygous and heterozygous 
amplified sequences, since half of the parental sequences 
in a heterozygous sample will anneal to the complementary 
strand of the same parental heritage and thus form a 
molecule without mismatches. In other words, when the 

10 starting material is heterozygous, half of the annealed 
molecules will be removed from the sample because they 
contain a mismatch due to differences in the starting 
sequences. However, half the annealed molecules will not 
contain such mismatches and so will be removed from the 

15 sample only if they contain mismatches which were created 
as a result of errors during amplification. In any event, 
mutation detection assays will require a second round of 
denaturing and annealing. 

ftLfrfflifi IpEBTIFIcmOff W 1TOT,,TT-AU»*TJC SYSTEMS 

20 As more alleles of disease -causing genes are 

identified, and in the quest to develop a polymorphism map 
of the human genome, it is becoming increasingly important 
to be able to identify particular alleles of a given gene. 
Immobilized MBPs provide a simple and straightforward means 

25 of allele identification. 

Unique, labeled oligonucleotide probes are 
prepared for each allele of a given gene, such that the 
probe is perfectly complementary to only one allele, i.e., 
the probe will form one or more mismatches when paired with 

30 an incorrect allele. The probe is mixed with an excess of 
amplified test DHA such that, after denaturing and 
annealing, every copy of the probe will be found in duplex. 
The process is repeated with probes for every allele in 
question. The annealed DHA mixture is then either; 

35 (1) -mixed with immobilized MBP on a support that can 

be removed from the suspension by cent rifugat ion? 
(2) passed through a micro-column of immobilized MBP 

on an appropriate column support; or 
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(3) passed through a filter support containing 

immobilized MBP. 
In any case, the immobilized MBP must be in excess such 
that all mismatch -containing DNA is retained. The 
5 supernatant, column flow through or filtrate is analyzed 
for the presence of label. Only in those cases where the 
probe is perfectly complementary to an allele in the test 
DNA will label be detected. 

In order to be certain that no single- stranded 
10 probe sequences are present, it may be necessary, or at 
least desirable, to include some single -stranded DNA 
binding component on the support for the immobilized MBP. 
This system works equally well for homozygous or 
heterozygous conditions. 



15 JSIlfi 

The present invention is also directed to a kit 
or reagent system useful for practicing the methods 
described herein. Such a kit will contain a reagent 
combination comprising the essential elements required to 

20 conduct an assay according to the methods disclosed herein. 
The reagent system is presented in a commercially packaged 
form, as a composition or admixture where the compatibility 
of the reagents will allow, in a test device configuration, 
or more typically as a test kit, i.e., a packaged 

25 combination of one or more containers, devices, or the like 
holding the necessary reagents, and usually including 
written instructions for the performance of assays* The 
kit of the present invention may include any configurations 
and compositions for performing the various assay formats 

30 described herein. 

In all cases, the reagent system will comprise 
(1) an immobilizable or immobilized MBP or functional 
derivative, preferably mutS, and (2) additional reagents 
useful in carrying out the assay. The kit may optionally 

35 contain labeled mismatch- containing oligonucleotides. For 
detecting a particular mutation, a kit may also contain 
labelled primers for carrying out a PCR. A kit according 
to the present invention can additionally include ancillary 
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chemicals such as the components of the solution in which 
binding of duplexes to the immobilized MBP takes place. 



Having now generally described the invention, the 
same will be more readily understood through reference to 
5 the following examples which are provided by way of 

illustration, and are not intended to be limiting of the 
present invention, unless specified. 



BXNDXHG OF MISMATCH- CONTAINING DMA BY 
10 IMMOBILIZED MISMATCH BINDING PROTEIN 



A. Materials and Methods 

1* Preparation of immobilized MBP 
A nitrocellulose sheet {0.45pm, Schleicher & 
Schull) was wet with reaction buffer (20mM Tris pH 7.6, 
15 0.01 mM EDTA, 5 mM MgCl 2 , 0.1 mM DTT) said placed in a dot 
blot apparatus (Bio-Rad) . 

Purified MBP, E. coll MutS, at a concentration of 
0.5 /xg/10 fil reaction buffer was spotted on the 
nitrocellulose paper in each well* The wells were 
20 incubated at room temperature, and the remaining liquid was 
pulled through with vacuum. Each well was washed twice 
with 100 ill reaction buffer by adding the solution to the 
well and then pouring it out. After the second wash, the 
remaining solution was pulled through by vacuum. 

25 2. Blocking 

The nitrocellulose filter was blocked with bovine 
serum albumin (BSA) to prevent the binding of other 
proteins or nucleic acids. Reaction buffer (200 fxl) 
containing 1% (w/v) BSA was added to each well. After 1 

30 hour at room temperature, the solution was poured out and 
each well washed with 2 x lOOpl reaction buffer by adding 
the solution to the well and then pouring it out. After 
the second wash, the remaining liquid was pulled through 
with vacuum. 
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3 . 01 igonucleot ides 

The sequence of the oligonucleotides used in 
these studies was taken from the 3 0 base region surrounding 
the site of the sickle cell mutation in the human 0-globin 
5 gene* The mismatch is at the site of the sickle cell 
mutation, although the mutant sequence used to form the 
mismatch is not the sickle cell mutation (the actual sickle 
cell mutation is an A:T -» T:A transversion) , Biotinylated 
oligonucleotides were biotinylated on the 5' end of the 
10 mutant strand. Biotinylation is accomplished during 

synthesis by adding a biot in-modified nucleotide to the 5' 
end of the oligonucleotides. 

G:T Mismatch: 

Mutant GCACCTGACT CCTGQGGAGA AOTCTGCCGT [SEQ ID NO:l] 
15 Wild- type CGTGGACTGA GGACTCCTCT TCAGACGGCA [SEQ ID NO: 2] 

No Mismatch: 

Mutant GCACCTGACT CCTGGGGAGA AGTCTGCCGT [SEQ ID NO:l] 
Mutant CGTGGACTGA GGACCCCTCT TCAGACGGCA [SEQ ID NO: 3] 



20 4. Binding DNA 

Biotinylated oligonucleotides, in 20 pi 
reaction buffer containing 1% BSA, were added to each 
well. After 30 minutes at room temperature, remaining 
liquid was poured out. Each well was washed with 5 x 

25 100 ill reaction buffer by adding the solution to the 
well and then pouring it out. After the fifth wash, 
the remaining solution was pulled through by vacuum. 

5. Binding S t rep t a vi din- conjugated Horse 
Radish Peroxidase (HRP) 

30 The presence of biot in was detected by its 

binding of Streptavidin . A 100 pi volume of 
Streptavidin- conjugated HRP (Pierce Chemicals) at a 
concentration of 50 mg/ml in reaction buffer + 1% BSA 
was added to each well. After 2 hours at room 

35 temperature, the solution was poured out and each well 
washed with 5 x 100 pi reaction buffer by adding the 
solution and then pouring it out* After the fifth 
wash, the remaining solution was pulled through with 
vacuum. 
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6. Enhanced ChemiLuminescence® (ECL) 
Development The nitrocellulose sheet was removed 
from dot blot apparatus and washed 3 times in a petri 
dish with 10 ml reaction buffer. Five ml ECL 

5 development solution {Amersham) was poured over the 

nitrocellulose. The substrate for HHP in this reagent 
is a chemiluminescent compound. After 1 minute, the 
solution was removed. The nitrocellulose was blotted 
dry and placed between 2 clear plastic sheets. The 
10 nitrocellulose thus protected was exposed to X-ray 

film in the dark for varying periods of time. In the 
experiments reported here, the exposure time was 1 
minute . 

7 . Competition 

15 In competition studies, the DNA binding was as 

described above, except that a constant amount of 
biotinylated mismatch- containing oligonucleotide (5ng) 
was mixed with varying amounts of unlabeled DNA, with 
or without a mismatch, and added to the wells. 

20 B. RESULTS 

1. Specific binding of mismatch-containing 
DNA 

by immobilized MBP 
Figure l shows the results (in duplicate) of 
25 adding increasing amounts of biotinylated mismatch- 
containing DNA (upper 2 lines) or mismatch- free DNA 
(lower 2 lines) . 

The immobilized MBP can detect as little as 
0.2 ng of mismatch-containing 30-mer, whereas no 
30 detectable binding of mismatch- free 30-mer was 

observed even with 200 ng of DNA. (The lines around 
the lower spots were artifacts of incomplete washing.) 

2. Competition Assay 

Figure 2 shows the results (in duplicate) of 
35 adding increasing amounts of unlabeled mismatch- 
containing 30-mer (upper 2 lines) or mismatch- free 30- 
mer (lower 2 lines) to 5 ng biotinylated mismatch- 
containing 30-mer, Although competition was clearly 
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visible with 50 ng of mismatch-containing DNA, the 
mismatch- free DNA did not compete until 500 ng, if at 
all. The far right column on the figure contains no 
MBP. 

5 The results indicate that, at least with the 

3 Outers used above, immobilized MBP discriminates 
between mismatch-containing and perfectly paired DNA 
with an efficiency of at least three orders of 
magnitude. Similar results have been obtained using 

10 54mers with a sequence derived from the V3 loop of 

HIV. Therefore, even if the discrimination decreases 
as the amount of perfectly paired duplex increases, 
the discrimination efficiency when using 300mers, 
considered to be the maximum useful length for 

15 polymorphism studies of the human genome, should be on 
the order of a factor of 100* 

EXAMPLE XX 

HETEHO ZYGOTE DETECTION IN HUMAN GENOMIC DNA 

The methods described above were used to 
20 detect heterozygosity at a specific location in human 
genomic DNA. PCR amplification was performed on a 
portion of exon 3 of the human glucokinase gene 
extending from codon 98 {encoding glut amine) to the 
stop codon {Soffel et al* f Prvc. Natl* Acad. Sci, USA 
25 89:7698-7702 (1992)). The wild-type double stranded 
sequence of 100 bases (SEQ ID NO: 4 and SEQ ID NO: 5) 
corresponding to exon 3 human glucokinase is as 
follows : 

5 ' gcactaacttcagggtgatgctggtgaaggtgggagaagg 
30 3 ' cgtgattgaagt cccactacgaccacttccaccctcttcc 

tgaggaggggcaatggagcgtgaagaccaaacaccagatg 
actcctccccstsacctcgcacttctggtttgtggtctac 

tactccatccccgaggacgcc 3' (SEQ ID NO: 4) 
atgaggtaggggctcctgcgg 5' {SEQ ID NO: 5) 

35 In the heterozygous DNA (see below) The C of 

the underlined GAG codon was mutated to T. The DNA 
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sequences tested were PGR amplified from genomic DNA 
obtained from: (l) a known heterozygote at that 
position, (2) a known homozygote at that position and 

(3) a presumed homozygote at that position. 

Test DNAb were denatured, by heating, allowed 
to reaimeal and tested for the presence of mismatches 

(I.e., heterozygote s) by testing their binding in an 
immobilized mismatch binding protein assay according 
to the present invention, utilizing E. coll MutS. 

A. Materials and Methods: 
1. PGR Amplification 

The following templates were used: 

(a) 2157 - heterozygote at glucokinase gene exon 3 

(b) DGK-101 - Human genomic DNA (homozygote at 
glucokinase gene exon 3) 

(c) Human genomic DNA, male (presumed homozygote 
at glucokinase gene exon 3) designated Sigma 
DNA {commercially obtained from Sigma Chemical 
Co,) 

The primers {obtained from Operon) were HPLC 
purified and had the following sequences, 
corresponding to the 5' termini of the two DNA strands 
SBQ ID N0;4 and SEQ ID NO:5: 

Primer #1: 5' {bio tin) -GCACTAACTTCAGGGTGATG 
Primer #2: 5' - GCGTCCTCGGGGATGGAGTA 
PCR Primer #1 contained biotin bound at the 5' 
end to allow detection in the ECL detection system 
described below. Primer #2 was radioactively labeled 
with a 5- ' 32 P -phosphate to allow quantitation of the 
different amplification products after removal of 
unused primers. Primer #2 was 32 P labeled using a 
kinase reaction prior to use in amplification. The 
kinase reaction mix contained: 70mM Tris HC1, pH 7.G; 
10 raM mgCl 2 ; 5mM DTT; 20/iCi ^-ATP; 30 units T4 
polynucleotide kinase; and 500ng DNA {primer #2). The 
kinase reaction was performed in a 20/il reaction 
volume at 37°C for 30 min. Kinase was inactivated by 
heating at 70 *C for 10 min. DNA was stored at -20 °C. 
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The PCR reaction included: lOmM Trie HCl pH 
8.3; 50 mM KC1; l.SiriM MgCl 2 ; 0.001* gelatin (w/v) ; 
0.05mM dATP; 0.05mM dTTP; O.OBmM dOTP; 0,05trtM dCTP; 
O.lpM primer #1; 0.075/iM primer #2; 0.025/iM 32 P primer 
#2; 200ng template DNA; and 2*5 units AmpliTaq DNA 
polymerase ( Per kin- Elmer ) . Reaction volume was lOOpl, 
Amplification was carried out for 30 cycles in a 
Perkin-Elmer thermocycler by denaturing at 90 °C for 1 
man., annealing at 55°C for 1 min. and extension at 
72 °C 2 min. Unused primers were removed by 
centrifugal dialysis using Centricon 30 
microconcentrators (Amicon) according to 
manufacturer's protocol. PCR products were 
quantitated by running equal amounts (measured as cpm 
of radioactivity) on non- denaturing 8% polyacrylamide 
gels, staining with ethidium bromide and comparing 
them to standard DNA, . 

2. Immobilized Mismatch Binding Protein 

Assays : 

DNA was denatured and annealed (in Perkin- 
Elmer thermocycler) according to the following 
schedule: 100°C for 4 min; 50°C for 1 hour; 75°C for 4 
min; 50 °C for 30 min, followed by cooling to room 
temperature . 

A nitrocellulose sheet (0.45Mm, Schleicher and 
Schuell, BA65) was wet by floating in reaction buffer 
(20mM Tris HCl, pH 7*6; 5raM MgCl 2 ; O.lmM DTT; O.OlmM 
EDTA) and placed in a slot blot apparatus (Hoef er 
Scientific Instruments) over 3 sheets of blotting 
paper (Schleicher and Schuell OB002) . lOOpl of 
reaction buffer was added to each well. After 5 min 
at room temperature, any remaining buffer was pulled 
through by vacuum, MutS.(500 ng in 20/il reaction 
buffer) was added to each well. The same volume of 
reaction buffer was added to "No MutS 0 wells . The 
apparatus was kept at room temperature for 20 min 
before proceeding to the next step. 

The nitrocellulose was blocked by adding 200^1 
of HRP-free BSA to each well. After 1 hr at room 



- 30 - 

temperature, any remaining solution was pulled through 
by vacuum, 

DNA preparations were added in 20/il reaction 
buffer containing 3* HRP-free BSA. After 30 min at 
room temperature, wells were washed 5 times with lOO/il 
reaction buffer by adding the solution to the wells 
and decanting it* 

The presence of biot in -labeled DNA. bound on 
the nitrocellulose sheet was detected by visualizing 
the binding of streptavidin to biot in. 100 fil 
streptavidin-HRP in reaction buffer containing 3% HRP- 
free BSA was added to each well. After 20 min at room 
temperature, any remaining solution was decanted* 
Wells were washed 5 times with 100/xl reaction buffer 
by adding solution to each well and then decanting. 
Any solution remaining, after the fifth wash was 
removed by vacuum. 

3. Enhanced Chemiluminescence (ECL) 
development 

The nitrocellulose sheet was removed from the 
apparatus and washed 4 times for 1 min with 50 ml 
reaction buffer in a small tray. Nitrocellulose was 
blotted dry and immersed in 10 ml of ECL development 
solution (Amersham) . After 1 min, the nitrocellulose 
sheet was removed, blotted dry and placed between two 
clear plastic sheets. The nitrocellulose thus 
protected was exposed to X-ray film in the dark for 30 
sec. 

B , RESULTS 

The results are shown in Figure 3* Bio-Het 
refers to a synthetic 30mer duplex of SBQ ID NO:l and 
SEQ ID NO: 3* This duplex contains a G;T mismatch at 
position 15. Bio-Homo refers to a synthetic 30mer 
duplex of SEQ ID NO:l and SEQ ID NO:2, which lacks the 
mismatch at position 15. No binding of Bio-Homo was 
detected even at lOng of DNA whereas Bio-Het was 
detected at amounts as low as O.lng. No binding was 
observed in the absence of MutS ( n no MutS* columns) 
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* 

indicating that all DNA binding observed ie MutS 
dependent . 

Binding of heterozygote nucleic acid (2157) to 
mutS was clearly visible at 0.6ng. Homozygote binding 
5 (independent of the source of the DNA} was faintly 
detectable at l,25ng and was clearly detected at 
2.5ng. Thus, heterozygote DNA was detected at least 
2-4 times better than homozygote DNA in this assay* 
The binding of homozygote DNA at higher concentrations 

10 was considered to be the result of errors introduced 
during amplification by the Tag polymerase. Such a 
high error rate in nucleotide incorporation by this 
polymerase is a well-known phenomenon. Thus, such 
inappropriate binding still represented mismatch 

15 binding by the immobilized MutS. 

The use of immobilized mismatch binding 
protein assays for mutation, heterozygosity or 
polymorphism detection are limited only by the ability 
to provide substrates free of random mismatches, such 

20 as those produced by polymerase errors during PCR 
amplification. 



EXAMPLE III 

DETECTION OF SPECIFIC MISMATCHES AND UNPAIRED BASES 
BY IMMOBILIZED MISMATCH BINDING PROTEIN 

25 Studies were performed to detect single base 

mismatches and one to four unpaired bases in 
heteroduplex DNA. These studies utilized synthetic 
oligonucleotides (30mers, which include SEQ ID NO:l r 
SEQ ID NO:2 and SEQ ID NO:3) prepared with a single 

30 mismatch at position 15 or 16 or with 1-4 unpaired 
bases between positions 15 and 16. The sequences of 
these oligonucleotides are shown in Figure 4. As 
described above, these sequences were taken from, or 
are closely related to, the human 0-globin gene in the 

35 region of the mutation responsible for sickle-cell 

anemia. The oligonucleotides were prepared with a 5' 
biotin label (to allow detection by the ECL detection 
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system) and were annealed to unlabeled 
oligonucleotides such that only one strand of each 
duplex was labeled* The duplexes were used in a 
mismatch detection assay using immobilized mismatch 
binding protein (MutS) , essentially as described above 
in the previous Examples* 

Oligonucleotides were diluted in TNE buffer 
{lOmM Tris HC1, pH 8.0; 0.01M NaCl, ImM EDTA) . Biotin 
labeled oligonucleotides were diluted to 10 mg/pl and 
unlabeled oligonucleotides to 100 ng/pl. Equal 
volumes of diluted oligonucleotides were mixed and 
annealed at 70°C for 10 min, room temperature for 30 
min, followed by quenching on ice and were stored at 
-20°C. The unlabeled oligonucleotides were in 10-fold 
excess to assure that all biotin- labeled strands were 
in duplexes. 

The results are presented in Figure 5. All 
mismatches except C:C and G: A were detected in these 
assays. T:C and C:A mismatches were not teeeted in 
this experiment. Not all detected mismatches are 
detected equally well. The order of detection 
sensitivity appeared to be 

G:T > G:G > C:T > AsC > T;T > A:A - AsG > C:C* 

The fact that A:G mismatches were better 
detected than G:A mismatches suggested that the 
sequence of the individual strands may influence the 
extent of mismatch detection at least in relatively 
short oligonucleotides such as those used here. 
However, G:T and T:G mismatches were equally well- 
detected, suggesting that well-detected mismatches are 
detected independent of strain orientation. 

Heteroduplexes with 1*4 unpaired bases were 
also detected. A heteroduplex with two unpaired bases 
was more easily detected than a heteroduplex with one 
unpaired base. Heteroduplexes with three unpaired 
bases were less well detected than any of the 
detectable mismatches/ and heteroduplexes with four 
unpaired bases were even less well detected. 

These results indicate that immobilize 
mismatch binding protein assays will detect all 



WO 95/12689 



PCI7US94/12768 



- 33 - 

mutations due to a single base change. Thus, although 
C:C went undetected, the corresponding G:0 mismatch, 
which would of necessity occur in any mutant: wild type 
pairing that had a C:C mismatch, was easily detected. 
5 In addition, the detection system can easily detect 
mutations arising from the addition of 1*3 bases. 

The references cited above are all 
incorporated by reference herein, whether specifically 
incorporated or not. 

10 Having now fully described this invention, it 

will be appreciated by those skilled in the art that 
the same can be performed within a wide range of 
equivalent parameters, concentrations, and conditions 
without departing from the spirit and scope of the 

15 invention and without undue experimentation. 

While this invention has been described in 
connection with specific embodiments thereof, it will 
be understood that it is capable of further 
modifications. This application is intended to cover 

20 any variations, uses, or adaptations of the inventions 
following, in general, the principles of the invention 
and including such departures from the present 
disclosure as come within known or customary practice 
within the art to which the invention pertains and as 

25 may be applied to the essential features hereinbefore 
set forth as follows in the scope of the appended 
claims. 
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WHAT IS CLAIMED 

1, A method for detecting a mutation from a 
non-mutated sequence of a target polynucleotide in a 
sample , comprising : 

(a) incubating a detectably labeled 

polynucleotide or oligonucleotide from 
said sample with a mismatch-binding 
protein immobilized on a solid support, 
under conditions in which mismatch- 
containing polynucleotide molecules bind 
to said immobilized protein; and 
(b detecting the binding of any mismatch- 
containing polynucleotide from said 
sample to said mismatch-binding protein, 
whereby the presence of detectably labeled 
polynucleotide or oligonucleotide bound to the 
mismatch -binding protein is indicative of a mutation 
in the sequence of the target polynucleotide. 

2. A method according to claim 1 wherein 
said mismatch-binding protein is the MutS protein or a 
functional derivative thereof. 

3 * A method according to claim 1 wherein 
said solid support is selected from the group 
consisting of natural cellulose , modified cellulose, 
polystyrene , polypropylene , polyethylene , dextran , 
nylon, poly acryl amide and agarose* 

4. A method according to claim 3, wherein 
said solid support is a nitrocellulose membrane* 

5. A method according to claim 1 wherein 
said detectably labeled polynucleotide or 
oligonucleotide is labeled with a chromogenic 
compound, a chemiluminescent compound, a 
bioluminescent compound, a fluorescent compound or a 
radiolabel » 
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6. A method according to claim l wherein 
said detectably labeled polynucleotide or 
oligonucleotide is labeled with biotin. 

7, A method according to claim 1 wherein 
5 said target polynucleotide la DMA. 

a, A method for detecting a mutation from a 
non-mutated sequence of a double stranded target 
mammalian polynucleotide in a sample, comprising: 

(a) denaturing any double stranded 

10 polynucleotide in said sample into single 

strands and allowing said single strands 
to reanneal into duplexes? 

(b) incubating the denatured and reannealed 
duplexes of step (a) with a mismatch- 

15 binding protein immobilized on a solid 

support , either 

(i) in the presence of a detectably 
labeled mismatch-containing 
oligonucleotide capable of binding 

20 to said mismatch binding protein; or 

(ii) wherein said a mismatch -binding 
protein was preincubated with and 
allowed to bind a detectably labeled 
mismatch-containing oligonucleotide ; 

25 and 

(c) detecting the amount of detectably 
labeled mismatch- containing 
oligonucleotide bound to said mismatch- 
binding protein, 

30 whereby the presence of a mutation in said double 
stranded mammalian polynucleotide of the sample 
results in a decrease in the binding of the detectably 
labeled oligonucleotide to the mismatch- binding 
protein * 
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9. A method according to claim 8 wherein 
said solid support ia selected from the group 
consisting of natural cellulose , modified cellulose, 
polystyrene, polypropylene, polyethylene, dextran, 
nylon, polyacrylamide and agarose. 

10. A method according to claim 9, wherein 
said solid support is a nitrocellulose membrane. 

11. A method for the removal from an 
amplified DNA sample of minority sequences and 
sequences containing sequence errors introduced during 
the process of amplification, which method comprises: 

(a) subjecting said amplified DNA sample to 
conditions of denaturation followed by 
reannealing, such that minority sequence or 
error -containing sequences form mismatch- 
containing DNA duplexes, thereby generating a 
mixture containing mismatched duplexes; 

(b) incubating the mixture of step (a) with an 
immobilized mismatch-binding protein so that 
mismatched duplexes bind to said mismatch 
binding protein; and 

(c) removing said immobilized mismatch binding 
protein to which mismatch-containing DNA has 
bound from said amplified DNA sample, 

thereby removing said sequences containing sequence 
errors . 
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12. A method for identifying a specific 
allele in a multi- allelic system in a sample of 
amplified DNA, comprising: 

(a) mixing a detectably labeled oligonucleotide 

5 probe which is perfectly complementary to the 

DNA sequence of said specific allele with an 
excess of amplified test DNA under conditions 
of denaturation followed by annealing such 
that, after denaturing and annealing , every 
10 copy of the probe is in a duplex DNA; 

(b) incubating the mixture formed in step (a) with 
an excess of immobilized mismatch binding 
protein, such that all mismatch-containing DNA 
is retained on said immobilized mismatch 

15 binding protein; 

(c) removing said immobilized mismatch binding 
protein to which has bound any mismatch- 
containing DNA from said amplified test DNA; 
and 

20 (d) detecting the presence of said detectably 

labeled probe in said sample from which said 
immobilized mismatch binding protein has been 
removed, 

wherein, the presence of labeled DNA in said sample 
25 indicates that the probe is perfectly complementary to 
an allele in the test DNA. 

13. A kit useful for detecting a mutation 
from a non-mutated sequence of a target polynucleotide 
sequence in a sample, said kit being adapted to 

30 receive therein one or more containers, said kit 
comprising: 

(a) a first container containing an 
imraobilizable mismatch-binding protein; 

(b) a second container containing a solid 
35 support capable of immobilizing said 

mismatch binding protein; and 

(c) a third container or a plurality of 
containers containing a reagent or 
reagents capable of detecting the binding 
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of a detectably labeled mismatch- 
containing nucleic acid duplex to said 
mismatch-binding protein. 

14. A kit useful for detecting a mutation 
from a non-mutated sequence of a target polynucleotide 
sequence in a sample, said kit being adapted to 
receive therein one or more containers, said kit 
comprising: 

(a) a first container containing a mismatch- 
binding protein immobilized on a solid 
support ; 

(b) a second container or a plurality of 
containers containing a reagent or 
reagents capable of detecting the binding 
of a detectably labeled mismatch- 
containing nucleic acid duplex to said 
mismatch- binding protein* 

15* A kit according to claim 13 wherein said 
mismatch-binding protein is MutS or a functional 
derivative thereof. 

16 . A kit according to claim 14 wherein said 
mismatch -binding protein is MutS or a functional 
derivative thereof* 

17. A kit according to claim 13 wherein said 
solid support is selected from the group consisting of 
natural cellulose, modified cellulose, polystyrene, 
polypropylene, polyethylene, dextran, nylon, 
polyacrylamide and agarose. 

18. A kit according to claim 17 wherein said 
solid support is a nitrocellulose membrane. 
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19. A kit according to claim 14 wherein said 
solid support is selected from the group consisting of 
natural cellulose, modified cellulose, polystyrene, 
polypropylene, polyethylene, dextran, nylon, 

5 polyacrylamide and agarose. 

20. A kit according to claim 19 wherein said 
solid support is a nitrocellulose membrane. 

21. A mismatch-binding protein immobilized on 
a solid support, such that said immobilized mismatch 

10 binding protein is capable of binding to mismatch- 
containing polynucleotide molecules* 

22. The immobilized mismatch-binding protein 
of claim 21 which is the MutS protein or a functional 
derivative thereof. 
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Type MUslflfltidfi Sequence seo id nq ; 

Homoduplex: GCACCTGACTCCTGGGGAGAAGTCTGCCGT 1 

aSTGGACTGAGGACCCCTCTTCAGACGOCA 3 

Q : T Mismatch: GCACCTGACTCCTGGGGAGAAGTCTGCCGT 1 

CGTGGACTGAGGACTCCTCTTCAGACGGCA 2 

T:G Mismatch GCACCT OACTC CTGTGGAGAAGTCTQCCGT 6 

CGTGGACTGAGGACGCCTCTTCAGACGGCA 7 

C:T Mismatch GCACCTGACTCCTGCGGAGAAGTCTGCCGT 6 

CGTGGACTGAGGACTCCTCTTCAGACGGCA 2 

G:G Mismatch GCACCTGACTCCTGGGGAGAAGTCTGCCGT 9 

CGTGGACTGAGGACGCCTCTTCAGACGGCA 10 

A:G Mismatch GCACCTGACTCCTGAGGAGAAGTCTGCCGT 11 

CGTGGACTGAGGACGCCTCTTCAGACGGCA 12 

G: A Mismatch GCACCTGACTCCTGGGGAGAAGTCTGCCGT 1 

COTGGACTGAGGACACCTCTTCAGACGGCA 13 

A:C Mismatch GCACCTGACTCCTGAGGAGAAGTCTGCCGT 14 

CGTGGACTGAGGACCCCTCTTCAGACGGCA 3 

A: A Mismatch GCACCTGACTCCTGAGGAGAAGTCTGCCGT 15 

CGTGGACTGA GGACACCTCTTCAGACGGCA 16 

T:T Mismatch GCACCTGACTCCTGTGGAGAAGTCTG CCGT 17 

CGTGGACTGAGGACTCCTCTTCAGACGGCA 2 

C:C Mismatch GCACCTGACTCCTGGCGAGAAGTCTGCCGT 16 

CGTGGACTGAGGACCCCTCTTCAGACGGCA 3 

Unpaired C GCACCTGACT CCTGGCGGAGAAGTCTGCCGT 19 

CGTGGACTGAGGACC CCTCTTCAGACGGCA 3 

Unpaired CA GCACCTGACT CCTGGCAGGAGAAGTCTGCCGT 20 

CGTGGACTGAGGACC CCTCTTCAGACGGCA 3 

Unpaired CAG GCACCTGACTCCTGGCAGGGAGAAGTCTGCCGT 21 

CGTGGACTGAGGACC CCTCTTCAGACGGCA 3 

Unpaired CAGG GCACCTGACTCCTGGCAGGGGAGAAGTCTGCCGT 22 

CGTGGACTGAGGACC CCTCTTCAGACGGCA 3 

FIG. 4 
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